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' We study the problem of bicoloring random hypergraphs, both numerically and analytically. 

We apply the zero-temperature cavity method to find analytical results for the phase transitions 
f^**) ■ (dynamic and static) in the 1RSB approximation. These points appear to be in agreement with 

£SJ ' the results of the numerical algorithm. In the second part, we implement and test the Survey 

, ' Propagation algorithm for specific bicoloring instances in the so called HARD-SAT phase. 

s : 

H) ■ 

\ I. INTRODUCTION 

(N ; 

The hypergraph bicoloring is one of the classic combinatorial optimization problems belonging to the iVP-complctc 
class 0. Its random version, bicoloring of random hypergraphs, is a very interesting problem for the phase transitions 
it shows. Indeed, varying the average connectivity of the random hypergraph, the model undergoes a transition Q 
from a phase in which all links can be properly colored to a phase in which a sizeble fraction of links are violated. 
* |— ) , Around the transition point most difficult instances accumulate. 

A graph is an ensemble of sites and links between them. In a hypergraph, the links connect triplets of sites. Each 
site (or vertex) can be colored in two ways, say black or white, so it is natural to identify it with an Ising spin variable 
that can assume the values 1 or —1. The link is considered to be satisfied if the three spins that share it are not all 
of the same color. In the following we will often refer to a link as a function node, as it is called for example in the 
K-SAT problem Q. The bicoloring problem consists in finding an assignment to all spins such that all the links are 
£h ' satisfied. Consequently a graph will be called colorable or uncolorable. 

We can write the Hamiltonian for the problem assuming that each unsatisfied link gives a positive energy and zero 
i otherwise. The total energy is proportional to the number of unsatisfied links: a colorable hypergraph will have a 

\ zero-energy ground state, while a non colorable one will have a positive-energy ground state, 
i The Hamiltonian for bicoloring a hypergraph Q reads 
> ■ 

G\ 1 ^ 1 + (Tj <Tj+ aiak + aja k 

vo ; ri= o ' ^ ' 

CH . {i,i,k}eg 
^ ' 

where <jj = ±1 are Ising variables (corresponding to the 2 available colors) and the sum runs over all the hyperedges 
CO , of Q. Note that a factor 2 has been introduces for computational convenicence [2|j. 

Each term in the above sum is equal to 2 if and only if all the spins in the same interaction are parallel, that is 
if all the vertices connected by a hyperedge have the same color. The Hamiltonian in Eq.Q thus counts twice the 
number of badly colored hyperedges. Perfect colorations correspond to zero-energy configurations. 

In the present work we focus on colorability of random hypergraphs with N vertices and M hyperedges, varying 
. the relevant parameter a = In a typical random hypergraph the connectivity of a spin (i.e. the degree of a vertex) 
<—j ' is a random variable distributed according to a Poissonian of mean 3a. 

Analogously to random K-SAT random K-XORSAT Q and Q-coloring of random graphs Q, the random 
O ■ hypergraph bicoloring is expected to undergo two phase transitions increasing a. The first one is called "dynamical 
transition" and is located at ad where solutions to the problem (perfect colorations) undergo a clustering phenomenon. 
At this point the complexity S, which counts the number of clusters of solutions, becomes non-zero. We remind that 
if M(E) is the number of states at energy E the complexity is defined by the relation Af(E) = exp Nil (a, E/N), so it 
is a function of a and of the energy density. In the region where the complexity becomes positive, on top of a great 
number of ground states there appear an even larger number of metastable states: the latter may trap and slow down 
linear-time coloring algorithms and local search randomized methods |fg. At present all known linear-time coloring 
algorithms stop converging for a values well below ad- 

The second transition takes place at ev c , where the ground-state energy becomes positive: for a < a c most of the 
hypergraphs are colorable, while for a > a c most of them are not. This transition is formally equivalent to the so 
called SAT/UNSAT transition of K-SAT [10 and K-XORSAT 4], and we will refer to it with this name, although 
it is also known as "COL/UNCOL" transition in the computer science literature. 

Known results on the SAT/UNSAT transition are only upper and lower bounds. The best upper bound for a c , 
found with rigorous calculation, is 2.409 8]. The best lower bound is 3/2 Q- In Ref.01 it is analyzed the more 
general problem of bicoloring random hypergraphs with p-spin hyperlinks. However for the p = 3 case the bounds 
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FIG. 1: Left: Average extensive energy for sizes TV = 20,30,40,50. The crossing point roughly localizes the SAT/UNSAT 
transition. Right: Average extensive energy as a function of the rescaled variable (a — a c )/V 1//2 . Data are represented with 
standard deviations. 
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FIG. 2: Fraction of colorable hypergraphs at TV = 20, 30, 40, 50. The finite-size corrections are in this case larger and then the 
crossing point is less clearly localized. 

are worse than the ones we mentioned above. Recent rigorous results on random spin models and random K-SAT (K 
even) |l]J, [12| have shown that the 1RSB results provide rigorous upper bounds to the phase transition point and we 
expect the same to be true in our case. 

II. NUMERICAL RESULTS 

We wrote a recursive Davis-Putnam algorithm |13| to color random finite-size hypergraph in order to localize the 
point a c , that will be calculated analytically in the next sections. Here we present the numerical results, whose 
uncertainties are very small thanks to the average of a large number of disorder realizations. In Fig. ^ (left) we show 
that the energy curves for different N cross at a c . Indeed for a < a c limjv^oo E — because all hypergraphs are 
colorable, while for a > a c E cx N and diverges for N — > oo. From Fig. we estimate a c ~ 2.1. All the curves can 
be nicely collapse when plotted versus (a — a^N 1 / 2 , see Fig. ^ (right). 

A second estimate of a c can be obtained from the curves of the probability of being colorable as a function of a 
(see Fig. 0). However here the crossing point is less clear because of larger finite-size corrections. 
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III. THE CAVITY REPLICA SYMMETRIC SOLUTION 



A. Self-consistency equations 

We now study the bicoloring problem with the cavity method at zero temperature 0, 0] . The simplest form of 
the zero-temperature cavity method is the Replica Symmetric (RS) approximation, in which we suppose the system 
to have a single state. The basic hypothesis of the cavity method is the lack of correlation between two randomly 
chosen spins, because of the local tree structure of the hypergraph. Thanks to these vanishing correlations, the energy 
of the system for fixed oo can be written as a function of the cavity fields hj and gj on the 2k neighbors of ctq 0] 

k k 

In the case of hypergraph bicoloring the function u and w are given by 

u(h 2 , ha) = 9(-h 2 )6(-h 3 ) - 9(h 2 )6(h 3 ) 

w(h2,h 3 ) = \h 2 \ + \h 3 \-\u(h 2 ,h 3 )\ [6> 

where 8(x) = 1 if x > and 6(x) — elsewhere. The u are integers and can assume the values 0, 1 or —1. Note that 
w = Y] \h\ — \u\ is a general relation for models with Ising type variables. 

In the thermodynamic limit, we can assume the probability distributions of cavity fields h and cavity biases u to 
have well defined limits, and write for them self-consistency equations 



Q(u)= I dPihijdPih^S^u-uihufa 



w r k (4) 

p (M = ^/3a(fc) / dQ( Ul )...dQ(u k )5[h-J2 u i) . 

/ n *> ; — i 



k=0 " i=l 



with 



/3„(*) = «e- 

As expected, these equations coincide with those obtained from a replica calculation in Ref. [l^ . 
Exploiting system symmetries one can always write 

Q{u) = c 5{u) + \b{u + 1) + 5(u - 1)] . (5) 

Analogously the distribution of cavity fields can be written as P(h) — Y^-oo Pi ^Q 1 ~ *)' wnere the coefficients pi are 
symmetric, i.e. pi — p~i- The self consistency equations can be then written in terms of po and Co as 

Jpo = e- 3 "( 1 - c °)/o(3«(l-co)) 

where Iq(x) is the zero-order modified Bessel function. c is the order parameter of the system and it satisfies the 
self-consistency equation 



1 - 



y/2(l ~ co) = e- 3Q(1 - Co) /o (3a(l - c )) . (7) 



For any a value a "paramagnetic" solution co = 1 exists, for which all the cavity fields are zero. For a > cxrs = 2.3336, 
there also exists a non-trivial "glassy" solution with cq < 1. 
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B. Energy density 



We now compute the RS energy density, following the notation already used in We must compute E(a) = 

AEi - 2aAE 3 where 



AE 3 = 



dP{h 1 )dP{h 2 )dP{h 3 ) ■ 

1 + (7l<7 2 + + 



mm 

(7 1 ,CT2 



h\(J\ - h 2 a 2 - h 3 a 3 + \hi\ + \h 2 \ + \h 3 \) 



= 2 J dP(h 1 )dP(h 2 )dP(h 3 )8(h 1 h 2 )8(h 2 h 3 ) = ±(l- Po f = V2(l-co)i 

00 „ / k k \ 

A^i = f^w / d ®( u ^ ■ ■ ■ E N - 1 E u *i 

k=0 J \i=l i=l / 

00 

= 3a(l - c ) - 2e~ 3Q(1 " Co) ^ r/ r (3a(l - c 



(8) 



(9) 



If we introduce the parameter A = 3a (1 — Co) which satisfies the equivalent of Eq.Q the total RS energy density 
can be written as follows 



E = A - 2e- A Y, "l^ 1 - e ~ A/ o(A)) 



(10) 



The expression ()1U[1 seems to be the same for the different models with Ising variables (like p-spin 01 1 K-SAT [l8| . 
etc.), the difference being only in the self-consistency equation for A, where a is multiplied by a different constant. 
For example the a us value for the present bicoloring model is twice the value it takes in the 3-spin model |17| . 



C. RS phase diagram 

If we plot the energy l|l(J|l versus a we see that the energy of the non trivial solution is negative for a < 2.5906. In 
the region 2.3336 < a < 2.5906 the RS solution is therefore non-physical, because the energy density of this problem 
must be positive by definition. In the RS approximation we have found a paramagnetic phase for a < 2.3336 and a 
glassy phase for a > 2.5906. This prediction is not correct, both quantitatively and qualitatively. The values of a 
where the transitions appear are not in agreement with numerical simulations, and there is a non-physical region. 



D. Instability of evanescent field in the paramagnetic region 



Before going to the 1RSB approximation, let us concentrate in this section on the RS paramagnetic region a < 
2.3336, in order to analyze the distribution of the so-called evanescent fields 19]. In the paramagnetic phase at zero 
temperature all the cavity fields hi are null, but considering the first order correction in temperature one can write 
hi = Th\ (whence the adjective evanescent). 

In terms of expectation values of spin variables, an evanescent field is the only one that can give a finite magnetization 
in the zero temperature limit: m = tanh(/3/i) — ► tanh(Zi'). On the contrary, in the 'strictly'-zero-temperature formalism 
that we use to study ground state energy, variables are either frozen, |m| = 1, or paramagnetic, m = 0, and we disregard 
any detailed information concerning the fluctuations of the local magnetizations of the unfrozen variables. The global 
probability distribution of the local magnetizations could in principle be non trivial, with some variable polarized (yet 
never frozen) in some preferential direction. 

There are two equivalent ways of obtaining such information on the distribution of magnetizations: The first consists 
in writing the iterative cavity equations for such magnetizations and next taking the average over the underlying 
random hyper-graph. The second simply consists in computing the RS cavity equations at finite temperature assuming 
appropriate scaling of the cavity fields. Taking hi — Th[ with h\ finite leads, in the (3 — ► 00 limit, to a distribution of 
evanescent fields which may describe non trivial expectations for the spins. 

Following the same steps which brought us to the RS self-consistency equations Q}, we can write analogously the 
self-consistency equations for the distributions of h\ = (3hi and u[ — f3ui in the j3 — > 00 limit. These equations look 
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identical those in Eq.Q), the only difference being the definition of the function u(h\, h 2 ), which now reads 

/ . _ tanh(fe / 1 ) +tanh(fe^) 
" h%) ~ tanh(/ii) tanh(/i' 2 ) - 3 " U J 

For very low a the only solution to the self-consistency equations is P(h') = 8(h'). At variance with respect to 
other problems like for instance 3-SAT |18| in which the low a phase is highly non trivial, the bicoloring problem 
is simple. As it happens in the Q-coloring Q and in the 3-spin problems pfEJj the very low a phase is a genuine 
paramagnet, with local fields concentrated around zero even at the first order in temperature. 

However the solution P(h') — S(h') and Q(v!) = 8(u') may become unstable at a certain value of a, that we 
call a s . In order to study the stability of this solution (in which local fields are uncorrelated independently of the 
local strucutre of the underlying hypcrgraph) it is enough to give an infinitesimal width to P(h') and check whether 
it increases or decreases under the iteration of Eq.Q. For very small values of h\ one can linearize the function 
u'(h'i, h' 2 ) — —(hi + h' 2 )/3 and obtain very simple relations among the variances of P(h') and Q(u') at two consecutive 
iterations (n and n + 1) 

((U') 2 )n+l = l((h'f)n , (12) 
((h'f) n+1 = M(u'f)n ■ (13) 

For a < a s — 3/2 the variances do not increase under iteration of the RS equations and the system is in a truly 
paramagnetic phase with all the magnetization identically zero. 

For a > a s , the presence of a broad distribution of first-order corrections h' suggests the presence of a full RSB 
spin-glass phase at finite temperature, produced by a "replicon" instability at a s . The finite-temperature phase 
transition at a s corresponds at T = to the onset of a non trivial organization of ground states, with non trivial 
magnetizations (unfrozen RSB scenario). We incidentally note that the value of a s coincides with the best lower 
bound available for a c . 

However, as soon as the dynamical transition is reached at a<j ~ 1.915 (see next section), the system looses memory 
of the unfrozen RSB phase. The non-evanescent fields, h = 0(1), arc the only ones relevant in determining the ground 
state energy. At the level of non vanishing fields, at ad we have a transition from RS to 1RSB. At this point, the 
analytically disconnected solution with vanishing fields disappears. The presence of full RSB is somehow accidental 
and we expect for higher number of colors to disappear completely (as it happens in graph coloring 



IV. THE CAVITY 1RSB SOLUTION 



A. Self-consistency equations: the distribution p(rj) 



In the previous section we have seen that the RS approximations produces a wrong solution. Here we study the 
system with a better approximation, the so-called "one step Replica Symmetry Breaking" . 

In this approximation the scenario is a bit more complex: at ay (< a c ) there is a clustering phenomenon so that 
the computation made in the RS case is only valid within each state (cluster) . It must be also considered the crossing 
between the energy of two states, for which we use the "reweighting parameter" fj, as in [Tlj . 

The 1RSB order parameter is a distribution of distributions, whose self-consistency equations are the following 



Q(u)- / dPi(hi)dP 2 (h 2 ) S(u-u(h u h 2 )) 



QIQ] = j DV[Pi]DV[P 2 ]S^ Q(u)-J 
V [P\ - f> a (fc) [HdQIQASW p (h)-^- ffldQi^e-^^-^^S^-^i) 

1 n A — 1 ™ " A 1 „- 1 



(14) 



(15) 



with 5^ being a functional delta, and Ak normalization coefficients. 

Thanks to the system symmetries the most general form for Q(u) is given by 



Q(u) = r)5(u) 



1-7? 



6(u+l) + S(u-l) 



(16) 



that is symmetric under u <-> — u and with u € {—1,0,1}. The heterogeneity of the random hypergraphs is now 
reflected in the very different values 77 may take: e.g. isolated plaquettes certainly have 77 = 1. Let us call p(rf) the 
probability distribution function of r\. The problem will be now studied in terms of p(r\), which completely determines 
the order parameter Q[Q]. 
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FIG. 3: Left: Probability distribution pirf) for a = 2.0 and a — 2.1. Note the trivial contribution in 1. Right: Average value 
of r/ versus a. This value is exactly 1 for a < ad — 1.915. 



B. 



oo limit 



Self-consistency equations (|14fl and l|15|) can be written as a single self-consistency equation for the distribution 
p(rj). In the /i — > oo limit it reads 



Pfa)=£/ 3 a(fc)£/3a(fc') 
fc=0 fe'=0 



k k' 




(17) 



]X_i?7-i. Ea. l|17|l can be solved by a population dynamics 



with the normalization coefficients A k 

algorithm. Starting from a population of ?ys randomly distributed in [0, 1] we then iterate the following steps: 

• take k elements and compute rf and Ak, where A: is a Poissonian number; 

• take fc' elements and compute rf and Ay , where k' is a Poissonian number too; 

• compute a new r] as 

1 

and insert it in the population eliminating another random rj. 

The asymptotic distribution p(r]) is plotted in figure [21 (left) for different values of a. For a > ad — 1.915 the 
distribution has both a trivial contribution in 1 and a non- trivial one in the [4; 1] region, while for a < it collapses 
into a single delta function in 1. 

In figure |3 (right) we plot the average value of 77 versus a, by which we immediately localize the dynamical phase 
transition at ad — 1.915. An identical curve has been calculated analytically in the more tractable case of the p-spin 
model 0. 




C. Complexity 

In the fi — > 00 limit the complexity is given by |l5j 



E = lim = lim <^ logA fc - 2a log[l - -(1 - - J-)] 



(18) 



where the averages are taken with respect to the Poissonian distribution of k and with respect to p(?/). 

The complexity curve is plotted in figure 21 we identify the critical point a c — 2.105 that corresponds to the 
SAT/UNSAT transition, as the point where the complexity vanishes. 
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FIG. 4: Phase diagram of random hypergraph bicoloring. 



D. Energy density and 1RSB phase diagram 



In order to evaluate free energy <E> we must generalize the computation to for finite values of /j. 
The self-consistency equation for general fi is 



oo oo r, k k 

k=0 fc'=0 i=l 1=1 



(19) 



where a k is the coefficient of the delta function in of the distribution P( k \h) computed by the convolution of k 
biases u, and A k is its normalization factor. To compute quickly the (h) we can use a recursive relation: 



P {k) (h)= J dQ k (u k )dP {k - 1 \g)5(h-g-u k )e-^ 



(Wk\+\g\-\g+v-k\) 



(20) 



The free energy is given by $ — <I>i — 2a$2 with 

1- 



log(A fe ) , 



$ 2 = - log (l - 1(1 -77)(1 -^i)(l- e -2M) 



(21) 



For a > a c , $ has a maximum at a finite value of fi: it means that the ground state has positive energy. Otherwise 
for a < a c $ is always negative, converging toward zero for fi — > oo: it corresponds to a zero-energy ground state. 
The energy density is calculated as 



^ J A fe ^ + ^i_i(l- r? )(l-^)(l_ e -2M) 



(22) 



As we did before, rather than computing the derivative of the A k , we can write a recursive equation for the probability 
distribution R^(h) = JLp( k )(h): 



RW(h) - J dQ k {u k ) dgiR^Hg) + (\h\ - \g\ - ^P^XgMh -g- u k ) e -^+^ 



\-W) 



(23) 



Injecting this calculation in the population dynamics algorithm provides directly the curve E{jj) = ^(/i$). The 
ground state energy is obtained as the point where E{ji) and < E>(^) coincide. 



8 



LU 




a 



FIG. 5: Energy density of random hypergraph bicoloring: comparison among finite-size numerical results and analytical 1RSB 
solution. 
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FIG. 6: Complexity versus energy at a = 2.5: note the non-physical upper branch. Along the physical lower branch at the 
threshold energy Eth = 0.062 the complexity is maximal, while it becomes zero at the ground state energy. 



The ground state energy density is compared to the numerical results in figure [SJ This curve must be considered a 
N — > oo limit of the finite N curves that we obtained numerically. 

Another interesting curve that we can compute is the complexity versus the energy, that we plot parametrically in 
jj, using E([i) and (see Fig. |SJ. The curve £ = fi(E — $) has two branches: the lower one is physical one and 

represents the true complexity |27| . 

The last quantity we display in Fig. 0] is E t h versus a, that is simply the maximum of E(fj,). 

Summarizing the 1RSB results we get the following scenario. 

There is a "paramagnetic" phase for a < ad — 1.915, where there are no metastable states and we conjecture the 
existence of linear algorithms for coloring the generic hypergraph. The cavity fields are zero, so the spins are not 
forced to be black or white. In the so-called HARD-SAT region ad < a < a c — 2.105 the generic hypergraph is still 
colorable, but the presence of many states makes the coloring procedure very difficult. In each ground state there 
is a core of spins for which there is a particular pattern of coloring: because of the existence of an exponentially 
larger number of metastable states, it is very difficult for local search algorithm to color the core in the right way. 
For a > ad the 1RSB approximation becomes less valid when high energy states are considered Most likely, the 
curve E t h would slightly change if a better approximation would be used. 

These 1RSB results are expected to be a very good approximation of the exact analytical solution, as it happens 
in the majority of similar combinatorial optimization problems. For the p-spin model 17] an exact solution has been 
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FIG. 7: Factor Graph representation of an energy minimization problem, 
found that is identical to the 1RSB one pLl2l|. 

V. SINGLE SAMPLE ANALYSIS AND THE SP ALGORITHM 

An innovative and useful reformulation of the cavity equations has been proposed in ref. 0. The self-consistency 
equations are used to study single random problem instances and allow to get a microscopic information about the 
behavior of the single spins in the stable and metastable states of given energy density. The method, called Survey 
Propagation (SP), is general and provides the core ingredient of a new efficient algorithm 3, 7, 22] for finding ground 
states within the glassy phase. Here we will apply and check SP for the bicoloring problem. This problem is half-way 
between the random K-SAT problem and the random K-XORSAT (or p-spin) problem. Since the SP algorithm does 
work for random K-SAT 0, but it does not seem to work for random K-XORSAT, we believe of primary importance 
to check its performances on the random hypergraph bicoloring problem. 

The iterative equations for the probability distributions of cavity fields that we have used in the previous sections 
to find the phase diagram were implementing at the same time a population dynamics process and an averaging over 
the random realizations. However, the equations can be easily iterated over specific realizations, that is avoiding 
the averaging step. In such a formulation the order parameter becomes the full list of the cavity fields over the 
entire graph. From the cavity fields one may determine the bias of each spin in all metastable states of given energy 
density and this information can be used for algorithmic purposes. The underlying hypothesis for the exactness of 
the single-sample formalism is the validity of the so called clustering condition within states: cavity fields should be 
uncorrelated within states and we expect this to be approximatively true thanks to the fact that the most numerous 
loops in the graph have a length that diverges as logN. 

In order to set up an appropriate formalism for the single sample analysis, we resort to the factor graph represen- 
tation [2^| of the bicoloring problem: variables are represented by N circular "variable nodes" labeled with letters 
k, ... whereas links (which carry the interaction energy) are represented by M square "function nodes" labeled by 
a, 6, c, ...(see Fig. 0. Function nodes have connectivity 3, variable nodes have a Poisson connectivity of average 3a 
and the overall graph is bipartite. The energy function can be trivially written as the sum over function nodes of 
their energies. 

Following ref. , we call "messages" the u terms which represent the contribution to the cavity fields coming from 
the different connected branches of the graph. In the message-passing language (typical of error correcting codes 
algorithms H^) one may describe the SP equations as follows. In the replica symmetric approximation, the messages 
arriving at a node are added up and then sent to a function node. Next, the function node transforms all input signals 
into a new message which is sent to the descendant variable node. At the 1RSB level, the messages along the links 
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FIG. 8: Iterative equations as message-passing procedure. 



of the factor graph are u-surveys of usual messages over the various possible states of the system at a given value of 
the energy (which is fixed by the rewcighting parameter /x). While the method is not restricted to zero temperature, 
at T — it assumes a particularly simple form because messages can take only few values, 3 in our case, and the 
u-survey are given by the probabilities of these values. The u-surveys are parametrized by 2 real numbers and the SP 
can be implemented easily. Each edge a — > j from a function node to a variable node j carries a u-survey Q a ^j(u). 
The algorithm finds these u-surveys and all the cavity fields Pi_> a (/i). Very schematically, the procedure works as 
follows. All the u-surveys Q a ^i{u) are initialized randomly. Next, function nodes are selected sequentially at random 
and the u-surveys are updated according the the equations: 



,. / k \ I k k 

/ dui...du k Q bl ^i(ui)...Q bk ^i{u k )8 I h - u a J exp I ^ u a \ - ^ 




(24) 



Q a ^i{u) = C a ^i j dgdhP j ^ a (g)P e ^ a (h)S (u - u { g, h)) (25) 

where the function u(g,h) is the one defined in Eq.J3J). In the above expressions, Ci^ a ,C a ^i are normalization 
constants and the labels hi identify the k neighboring function nodes different from a connected to site the variable 
node i (see Fig. |SJ) 

Parameterizing the u-surveys as 

Qa^i(u) = (1 - vUi - r)-^)5{u) + V +^S(u - 1) + v -^6(u + 1) (26) 

the above set of equations I|24I25|I define a non- linear map over the 77s j2^| . 

The process is iterated until convergence is reached and finally the stable set of u-surveys are used to compute the 
N local field {Pi(Hi)}) distributions and the free energy $(^). We have: 

Pi{H)=Cij Yl du a Q a ^{u a )5 Ih- wJexpLfl £ u a \ - £ K| (27) 

with Ci being the normalization constant and V(i) the set of function nodes connected to variable i. The free-energy 
reads 



(M N \ 

\a=l i=l j 



(28) 
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FIG. 9: Free energy 00) for different samples of size N = 10000 and a = 2.05, 2.1, 2.2. 
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In the above expressions, T^(a) identifies the set of variable nodes connected to the function node a and E a is its 
energy (i.e. the link energy).. 

The complexity S(/i) = d<fr(p)/d{l/ y) and the energy density e(//) = d(fi$>(ii))/d(i of states can also be estimated 
over single instances. Fig. 10 shows the free energy 4>(fi) of single graphs with N — 10000 vertexes as a function of /x 
for different values of the average connectivity a. Fig. (|10fl shows the ground state energies and threshold energies for 
single instances at different a. Similar data can be produced for the complexity. The agreement with the averaged 
calculations of the previous sections is indeed remarkable already for relatively small values of N (as it should be 
expected from the self-averaging property of the free-energy) . 

Once the information concerning the effective local fields acting on the single spin variables becomes available a 
decimation procedure for finding ground states can be easily implemented. We have done one such implementation 
for the fi — > oo case, with the scope of finding perfect colorings in the dynamical region just below a c . In this regime, 
the expression of the nonlinear map simplifies considerably. From eqs. (|24I25|I we find 
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FIG. 10: Ground state energy and threshold energy for a single sample of size N = 10000 at different connectivities. 
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The value of Va^j can be calculated by normalization. Other relevant quantities such as the biases of variables 
and the complexity also acquire a simple form. Upon defining the bias W i '° of a variable as the probability of 
picking up a cluster of ground states at random and find that variable frozen in some preferential direction, that is 
W+ = Prob(ff; > 0), Wf = Prob(i^ = 0), Wr = Prob(Jfj < 0), we have: 
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(32) 



For the complexity we have: 
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where 



S Q = log [] ( U Ua + 
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log n+ + ft: 




(36) 



With the list of the biases on hand, the following simple decimation procedure to find ground state configurations 
can been implemented: 

1 . {77} ^random 



(a) Iterate eqs . (I24IJ25D until a fixed {?/*} point is reached 

3. Compute the biases = Prob(i7, ; > 0) , Wf = Prob(Hi = 0), Wj~ = Prob(Hi < 0), following eq. 
(l27l) . 

4. For Bi = — W~ , Choose i such that \B^\ is maximum. 

5. IF \Bi\ <e for all i then STOP (paramagnetic state) and output the reduced sub-problem.. 

6. FIX (Ji = 1 if Bi > 0, (Tj = —1 otherwise. 

7. GOTO |U 

One should notice that along the decimation procedure some of the variable are fixed and therefore new types of 
links appear. The corresponding new function nodes will have an energy which is inherited by the 3-body interaction 
by fixing one of the variables. Once decimation has started, the bicoloring problems becomes a mixture of graph and 
hypergraph bicoloring. 

The behavior of the algorithm on sufficiently large (n > 10 3 ) random bicoloring instances is the following: 

• for low a (a < ay), the variables turn out to be all paramagnetic (zero bias). 

• in the dynamical region the biases are non-trivial and the decimation procedure fixes many variables leading to 
sub-problems which are paramagnetic and easily solved by a greedy heuristic. Very close to a c the decimation 
procedure may fail in finding solutions in the first run. In this region the algorithm can be improved in many 
ways, e.g. by a random restart or a backtrack or a different decimation strategy. In any case we can not exclude 
the existence of a threshold close to a c where the decimation procedure stops converging. 

For small N the structural "rare events" of the random hyper-graph, like links sharing more than one variable 
or other types of short loops, require an appropriate (in principle simple) modification of the SP iterations [24| . 
More in general, the presence of loops of different length scales may introduce correlations which may require further 
non-trivial generalization of the whole SP procedure. 



In this work we have given a very detailed description of the random hypergraph bicoloring problem, both on the 
average-case and on single samples. 

After having defined the statistical model corresponding to this problem, we have applied the cavity method to 
solve it: results in the RS and the 1RSB approximations have been presented. 

Increasing the connectivity a the model undergoes several phase transitions, which can be summarized as follows: 

• for a < a s the model is in a genuine paramagnetic phase, all the magnetizations are identically null; 

• at a = a s a "replicon" instability takes place, which manifests at finite temperature with the onset of spin-glass 
order (full RSB); 

• for a s < a < a.d the presence of a full RSB phase at finite temperatures is reflected in the ground states by 
finite values for the spin magnetizations; 
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• at a = ad a clustering transition takes place among the ground states. They split in an exponentially large 
number of clusters. Within each cluster a finite fraction of variables is completely frozen (backbone); 

• for ad < a < a c the model has a non-zero complexity and an exponentially large number of metastable states, 
which may block local-search algorithms. Although the very strong correlations among variables the ground 
state energy is still zero and the problem is colorable on average; 

• at a — a c the COL/UNCOL phase transition takes place; 

• for a > a c the ground state energy is positive and the problem can not be colored on average. 

In the second part of this work we have applied the Survey Propagation algorithm to problem instances taken from 
the HARD-COL region (ay < a < a c ), finding in polynomial time solutions to the problem. So we have verified that 
the SP algorithm works properly also for this model, which is harder than the 3-sat problem 7]. Indeed this model, 
at variance with K-SAT, has no local biases which could in principle be exploited by a smart algorithm. 

Next steps in this line of research will be to consider random hard combinatorial problems endowed with some non 
trivial local structure of the underlying graph. This constitutes a conceptual challange that will bring the algorithmic 
and anlytical tools developed for sparse graphs closer to what is found in the real-world version of the same class of 
models [25). 
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Local fields will turn out to be integer valued rather than fractional. 

For the unphysical one there is not still a precise interpretation |lq|. however it seems not to have any physical meaning. 
In the algorithmic formalism we need a more general parametrization of surveys with respect to the one used in the first 
sections. As we shall see, along the decimation process the symmtries of surveys are lost. 



